INFORMATION 
_ RETRIEVAL 


A Critical View 


Edited by GEORGE SCHECTER 


Based on Third Annual Colloquium 
on Information Retrieval 
May 12-13, 1966, Philadelphia, Pennsylvania 


Sponsored by 


Special Interest Group on Information Retrieval, 
Association for Computing Machinery 


Delaware Valley Chapter, Association for 
Computing Machinery 


Institute of Electrical and Electronic Engineers 
Computer Group, Philadelphia Section ‘ 


_ United States Army Frankford Arsenal 





. . 
Mvore School of Electrical Engincering, te 
University of Pennsylvania ee 
| va . 
. a 
7 
1967 wa x 


THOMPSON BOOK COMPANY 3 


Washingtoo, D. C. pe @ 
oO. 7 7 
Via ge” 


ACADEMIC PRESS . yy 
ndon “ 


GETTING IT OUT OF OUR SYSTEM 


THeopor H. NELSON 
Harcourt, Brace & World, Inc. 
New York, New York 


SUMMARY 


Specialist access and muster of information—usually the underlying task 
of “information retrieval’—miay not be best accomplished either by index- 
ing techniques (document retricval) or queriable information networks 
(content retrieval). Digital text storage and display make possible the cre- 
ation of at Icast one potentially powerful medium: the hypertext, or non- 
lincar text system, As a medium, not a facility, it will share traits of book 
and film. Machines will present complex interconnected text units to users 
who will weave their way through them at a glowing screen. Successive 
user choices will evoke successive presentations which have been created— 
written and structurally designed, and signed—by individuals and (cams. 
Departing completely fram the convention of lincar text as we know il, 
the hypertext will probably be much more cllective for everything. 

Several parallels exist between the (potential) development of the hyper- 
text medium and the actual development of the motion picture in its carly 
years: the inventors of the technology underestimated the importance and 
generality of the resulting medium; techniqucs of predecessor media were 
imported to it, and the techniques and structure peculiar to it took some 
years to develop; the character of ils units was not at once discovered; and 
credit to a work's creators was for a time thought unimportant. Films had 


191 








S 
12 INFORMATION RETRIEVAL: A CRITICAL VIEW 


One speck advantage: a standardized technical context existed to make the 
exchange of units simple. 


INTRODUCTION 


Whitdo we want to ret out ofan information system? There is a great 
need lo turn the new machines—computers, mass memories, and aaak 
disphiys-—into tnformation systems, or, to be more inane aura 
storage and supply systems (ats distinct from systems for alone rO- 
cessing.) But how are they to work, and what are they to do? | | 

What do We Want lo et out of an information system? The way we 
answer this rhetorical question probably settles the major aspects of the 
SYSISHD We design, The hard-nosed answer ts “Documents.” “Phe dreamy- 
eyed answer Is “Pure information.” The hesitant answer is to ask another 
question, “What sorts of thinus is the system for?” Let us seslie this sub- 
question neutrally, [we try to Chink in terms that avoid the ve euiepesaid 
commantinents of information retrieval, perhaps we can find a new en of 
sceIng the problem contest, aud better understand the conventional a 
hors dor what they are and de. 

What do we want to eelout of ai information system? The mformat 
probiem Ewill confront here ts the pie scr ) terres 
nee | of how to Keep specnul- 
Isls mitorined and updated. Po sive lan acronym, PE would venture that 
evIting um are Spechilist feeess and Auster of Jaformation (SAMI 
in SOL, Poo speak of “access and: muster” avoids the cynantanuits of 
retrieval that the miormation has been lost, prodigal, or shot ee in 
he busties, (We will exclude other possible meaaigs of the term “informa. 
thon retrieval” consider Library iventory systems to be a temporary 
expedient, and management imfornmiidon to be a job for realtime systena 
PSE) Powill propose that SAMP is best served, not by document 
retricyval or content retrieval, cach an its way more conventional than it 
looks, but by something ele again. This paper is an ekiborate conjecture 


wee . So he, Spee 
Ich TP ohope to give verpsimititude by an extended) historical analogy 


INEFORATATION RETRIEVAL” 


| Dlormation retrteval me the Presenl sense now means Oo things One 
Gu) = . . ; ware Fah igt . . : a 7 - . 7 
cuument retrieval, uses stindardized indesiy techniques to driunww desired 
clits cs ; . ‘ an “ae ; an 
sses of unit documents from a central pool. Phe other, content retricval 
helds a bay aa ms 
‘lds: y of “agate ays Ae: 
| ly inteerated Knowledge to which the user may widdress ques- 
eee ve ie - . 
liens. (for a united treatment of the two types, see Licklider.!) Each ts 
Mpartant, 





Grerrine tt Our oF Our SYSTEM 193 


A. DOCUMENT RETRIEVAL 


Let us examine the first meaning of information retrieval, document 
retrieval (DR), the fading of documents having certain indexcd deserip- 
tions. In document retrieval we index a collection of documents by partic- 
ular terms or aspects, and tater only get things back through those terms. 
This approach retains the unit of the discrete “document,” taking ilas given. 
The documents thenisclves are not expected to have any provision for the 
system, or to have been created with any relation to the system in mind, 
but are produced by the world according to external causes. The document 
is integrally sacrosanct, its content and form untouched by the sysicm. No 
tampering, excerpling of postediting of the documents is permissible, no 
matter how useful it might be. 

The problem of the system then becomes finding “the nght” documents, 
which the user recognizes by criteria of his own. The match between the 
user's private criteria and the available indexing is indicated by the num- 
ber of “hits’——-documents the user ts pleased with. 

Studies of library use have indicated that not very much library use could 
be aided by conventional document retrieval systems as currenuy construed. 
This would be all right if we could be sure they were ood for something 
else. 

That is one criticism of document retricval: it 15 
tthe users anced to have done, even 


not yet clear that 


document-reticval systems will do wha 


insome ordinary sense, and before we ster reforming the users. 
The second criticism of document retrieval is based on the nature of the 
process. (Indexing is categorization, even when 


indexing, OF CHCBOrIZAUOD, 
clational tagging of roles and links. ) 


it becomes complicated with thet 

The best catevories, and indices, decay. In principle any Indexing cale- 
yories Will progressively decline in relevance. His quite reasonable to Expect 
ests change ina ficld, many or Most inderiny 
catevories for information retrieval will gradually fall apart. A’ cursory 
amination of the Dewey Decimal system should confirm this point. I 
y the original indexing 


that as cmphasis and inter 


cx 
the structure of a particular ficld strays from the wa 


caw it, we will look in the wrong plice for the information that was pul 


uwily carher. 


B. CONTENT RETRIEVAL 

The second sense of information retrieval now being uscd is content 
retrieval (CR). Content retrieval is often called “fact retricval’; a systeti 
for content retrieval is sometimes called the QAS, or question-answering 


system. (An excellent source on CR is Simmons.’) 








194 INFORMATION RETRIEVAL: A CRITICAL Virw 


Such systems can be scen as having two parts; some overall method of 
storing a complex network of interrelated Information, such as that. of 
Marill,? and a program mechanism for following through its different: parts 
fo find something out. Network and query programs are intertwined; the 
design of One has consequences for the other. However, for the present 
purposes we may call such a system the “queriable information network,” 
since the information network is the basic part: in principle a way can 
presumably be found to interrogate any adequate lovistic systena. 

Content retrieval discards the unit of the document. The total contents 
of a queriable information network need not be divided into units at all, 
except those described in the information and those required lo represent it. 

Such systems olfer great promise in a number of ways. However, they do 
have present and future drawbacks. Whether such systems can replace 
sigoificant amounts of library use cannot be ascertained. Second, preparing 
Information to go into the queriable information network may be an 
extremely dithcult, touchy and expensive business, at least as thines look 
now, For some systems, coders may require tuning in mathematical nhota- 
tons Of great complexity. Third, conversing with such a machine will on 
some systents usually, and in the most lucid: systems sometimes, require 
formulating questions in arcane languages or duleets. Either these can be 
Known only to specialists, or their use by the laity may lead to mistakes. 

The major preblem, thouvh, and the real long-term drawback, is anal- 
w2ous to the decay of categories and indices in DR. Content. retrieval 
assumes that all the information in the network is consixtent and true, and 
his no theonvenicnt gaps. hike any overview, CR must ignore uncerkun- 
Hes and hieunae, and: Gis “knowledge” is revised and interests shift) it 
may leave only the tiatype of a certain moment’s concerns, As scientific 
and other beliefs are constantly changing, “knowledge” must frequently be 
readjusted or rearranved, 

Actucly, the idea of “knowledge” is misleading in this context. What we 
remand: as Knowledge includes various contradictory sets of interwoven 
propositions, clusters of argument, and discrepancies of faith, taste and 
cmpiasis im the lterature, Phere is usually disagreement in every field of 
Knowledge. This means that ina queriable information network, different 
and mutually contradictory theories really ought to be able to coexist with- 
In the system, 

Moreover, we may surmise that in most ficlds of knowledge some non- 
controversial information is tneorrect, but we don't know which. lt follows 
that unless the information network can be modified as’systems of informia- 
Hon are moditicd, the information network will be left behind. While evo- 
lutionary and patching modifications might be arranged to cope with the 


GETTING I1 OuT OF OurR SYSTEM 195 


representation of changing views, this expands the problem significantly. 
(For more on such evolutionary issucs and diflicultics, sce Nelson.*) 


MANIFESTO 


I would like to make the claim that document and content retricval sys- 
tems are not the principal nor the basic use of computers for handling ideas. 
[am not chiming that such systems are useless or improper. Rather, | 
think they will be helpful accessorics to the proper, natural and truce usc of 
computers for handling the written word. 

Systems thinkers have proposed a great varicty of possible arrangements 
for ordering: man’s thoughts and aiding his mind with the computer. | would 
like to propose that this diversity of possible facilities and functions is in 
fact comprisable into an organized text medium, to be created and used by 
people and for people. This organized medium will be that of the text 
structure created expressly for on-line display by the computer. Such a 
text will be read from an illuminated sercen, the cathode-ray display; it 
will respond or branch upon actions by the user. It will be a succession of 
displays, largely textual, that come and go according to his actions. Such a 
converies Of interconnected text may incorporate and have available many 
chunks of literature on a given subject, or many subjects, purposely assem- 
bled and woven together by authors and cditors. The naine for this nev 
mediuin is hypertext. 

Hypertext is the combination of natural-languape text with the com- 
puter’s capacities for interactive, branching or dynamic display, when 
expliculy used as a medium, Or, to define it more broadly, “hypertexe’ 
is the penertc term for any text which cannot be printed (or printed con- 
venicntly) on a conventional page, or used convenicnatly when bound 
between conventional covers. “Non-linear text” might be a fair approxi- 
mation, Hypertext may differ from ordinary text in its sequencing (it may 
branch ito trees and networks), its organization (it may have muluiple 
levels of summary and detail), its mode of presentation (it may contain 
moving of manipulable iHlustrations, moving or flashing typography), and 
so on. At last, with the appearance of the computer display and appro- 
priate back-up systems, such texts are possible and practical. 

We will store natural-linguage texts in digital devices for dynamic dis- 
play, and arrange them into new kinds of units. This means discarding 
customary document boundaries, and organizing the materials in a complex 
and non-linear fashion, The user will read from a screen, and as he goes he 
will make choices that call connected materials into view. Hypertext will 
be a medium, not a facility, and the institutions of authorship, citation, 





196 INFORMATION RETRIEVAL: A CRITICAL VIEW 


academic Commentary and argument will be preserved and incorporated 
in an orderly fashion. 

Thousands (or millions) of paragraphs of source material may be avau- 
able directly to the inquiring reader who wishes further detail, or a certain 
person's comment, or a specific idea or sentence. Moreover, the orfgial 
comexe of such items may be summoned, and such contexts —arlicles, 
books, or other hypertext units—themselves examined. Thus the reader in 
search of understanding may continually uncover new clements of interest, 
often of growing relevance to his hunches, his confusions and his unphrased 
yuestions. He can find explanations, ideas, or whole new areas that are to 
his interest. He may study them through further excerpts, reviews, sum- 
Miries, or even conventional reading. 

Vhe reader will explore, light pen in hand, browsing or studying closely, 
choosing paths he prefers, criss-crossing these paths on summary levels, 
sicpping from section to section and stopping for deeper work in units of 
interest. Tle will come to understand the whole, or a general section, like a 
walker exploring a city. 

Hy pertexts will require no special traiming to use, and rather little special 
training to create, though like works in other media they can be made either 
well or poorly. To use, or read, a hypertext, you will need mainly to sitata 
screen, read, and indicate choices by pushbutton or heht pen. To create a 
hyperfeat, you will type in new teat and indicate the new rehitional con- 
neclions (and their operational meaning) to the system. Where you wish to 
establish fixed sequeaces or give a test, you will also state what user’s action 
is to evoke what presentation, 

There Is reason to suppose that hypertext systems, once created, will 
serve a range of purposes. These will include the informing of professionals, 
education, briefings, giving directions, and even providing leisure reading. 
In the present SAM TP context, this generality is not accidental but necessary. 
While we will not discuss hypertexts for education in this article, we will 
discuss the need for this generality in satisfying the customary purposes of 
Information retrieval, 

Ol course itis risky to speculate on what the particular forms and uses 
of hypertext will be, since tls actual presence and demonstration should 
swiftly Gulstip speculation; until then we can have litde feel for it. For any 
but conventional text, we have litte notion of the comparative sense or 
uuility of different sequences and structures. All that we can tell at this 
me is that there are many possibilities. Until the money is spent to try i 
speculation is all we can do. But let us do that. 


GetTing fr Our OF Our SYSTEM 197 


GENERAL TEXT STRUCTURES 


Let us consider in the abstract some of the ways that hypertext may be 
organized. We will assume the use of some machine capable of presenting 
the texts in any arrangement, sequence or response network that we wish 
lo plan. What ways are there? (To save space and accentuate continuity 
with the present, we will refer only to “text structures,” recognizing that 
urranyements which depart from the printable collectively deserve the term 
“hypertext.”) 


A Lie ONT ES 

Text structures are composed of units. These units may be for conven- 
icnce( the page of typescript) or have some meaning (the paragraph, the 
book.) The units may be composed of other units (as chapters are made 
out of paragraphs) and may be combined to form larger units (as chapters 
make up a book). 

It is important to note that we may discard conventional document 
boundaries without discarding the notion of unit organization; organized 
units of text need not have linear sequence, as they do usually. Morcover, 
organizing text into units need not restrict the types of interconnection 
among these units that may be tricd. We may indeed define unit boundaries 
in hypertexts, to give creators and users a scnse of orientation, and to facil- 
Hate administrauon, 


BEN Pes 


Ict us then consider different types of texts. Acknowledging that this 
taxonomy is quite tentative and uncertain, | will try to enumerate some 
characteristics that texts of all kinds can have. The object is to learn what 
traits will distinguish the different kinds of text (especially, hypertext) 
from one another. 

Collection vs. Composition. We may distinguish text structures according 
to the way in which their units are created. We may distinguish between 
collected texts, which are assembled by some method or rule, and com- 
posed texts, which are put together with consideration of some of their 
overall propertics by some human agency. As examples of collected text 
we may take a full mail box, a library, or an issue of The New York Times; 
as examples of composed texts we may consider most vooks or poems, or 
the front page of The New York Times. 

Sequentiality. The next obvious distinction is that of sequentiality. The 
texts may be sequential, that is, have a specific order of presentation, or 


198 INFORMATION RETRIEVAL: A CRITICAL VIEW 


structural, with the reader free to move ainong the parts as he sees fit. 
(Books as we know them are of both kinds, and there are mixed cases as 
Well; again, the newspaper. ) 

Connective Structure. The connective relations of texts are yet another 
basis for distinguishing among them. There may be many different patterns 
of discrete connections among minimal text senments, Sunilarly, between 
the larger text units—which may contain many pleces—there may be many 
devrees of overlap, interconnectedncss and penetration, Print: technology 
has limited our expericnce of text interconnection, but there are simiple 
conventional examples. These include parallel texts (commentaries, par- 
Odies) and hooked-on materials (the footnote, the illustration). 

Hierarchical relations may also exist among text structures. That is, some 
units may be “over” others. By convention this hierarchical relation has 
lo do with conceptual generality in the hivher unit. However, hicrarchy 
might (ake on other meanings. Even for the existing “penerality’” conven- 
lion, lattice or other connective structures are possible, where the ‘more 
general” units could be for ditferent contexts and SCLUNLS, 

The liypertext will have other attributes, based neither on units nor 
connevtedtess. These we may deem (depending on various things) to be 
vad the user interface,” at the option of the text’s creator, or left to the 
decisions inherent in an administrative setup. They include the user’s free- 
dom to read or use anything contained in the text complex; his ability to 
(ike notes, trace his wanderings, or otherwise to couple the use of the hyper- 
lext with his own library system; and permission to contribute to the text 

The supporting facility that presents the hypertext must also hayve— 
besides displiys and calls, indexing, and. the ability to cross-follow from 
one part to another sach other functions as word searches and counts. 
(However, note that the provision of service functions that speed up look- 
Ups sind cross referencing should not be confused with the hypertext 
medium.) Temay also usefully include such complex Indexing and queriable 
IMformation structures as are now associated with DR and CR, just as books 
how conatun indexes. 


SUPPLANTING TRO WIPE IYPERTEXT 


The problem is to answer questions and update spechilists; that is, to 
facilitate specialist access and muster of information (SAMI). 

Let us consider here a particular text structure for SAMI purposes, the 
spechilist (or scholarship) hypertext. Let me offer speculations about what 
the specialist hypertext will be like. 


The hypertext for these purposes will probably be a particular kind of 





GETTING Ir OuT OF Our SYSTEM 199 


anthology. Tt will be a repository hypertext, continually enlarged and 
intermittently edited. [ts constituent materials will be from time to ume 
assembled into composed units; for instance, a collection of source texts 
Ina piven arca of discourse. This collection will be connected with many 
links to specially-composed review articles, arranged in various strata of 
summary, detail and conunentary, By reading in one of these inaster texts, 
and turning to original materials or other instructional sections, the user 
may keep abreast ofa field or tastruct himself in an unfamiliar area. 

The creation and maintenance of these specialist hypertexts will be a 
cychical process. First the “original? units or documents will be put in; 
these will correspond to reports and articles as they are now conventionally 
published. So assembled, they will constitute a collected hypertext, struc- 
tured only by minor indexing——cross-referenced, say, by chronology, 
authorship and topic. These source materials will remain in the system. 

Prom time to time these source materials will be assembled into com- 
posed units; without, however, losing anything in structure or independence, 
since they will continue to be separately addressable. The new composed 
units will be created by author-editors, like anthologies and review articles. 
These author-cditors will write overviews, summarics and other passapes 
of their own, create connections, make excerpts and digest versions, and 
add clinhications, arguments, comparisons, diagrams and miscellany. 

The units will be signed, like review articles, and thus linked to the pro- 
fessional prestige of their authors. Short squibs and even connective rela- 
Hons may be attributable in this way to particular persons. The authorship 
will consist of both writing new material and making connections and 
acrangements among parts of what existed before. It follows that readers 
who know something may contribute materials. This requires an appro- 
priate administrative context. Some people might need to have their work 
reviewed, others not. 

Such a corpus will have new research results, discussion, and interjec- 
tions continually added to it by identified individuals. Tt will be continually 
calanied, restructured, reconstructed. Periodically the overview-articles will 
be updated or replaced, with the old ones kept available for the record. 
Materials of Fading interest will not be destroyed; it will just take longer to 
get to them, and the newcomer will hear of them less. 

Dissents may be incorporated, The system imposes no assumption of 
completeness or infallibility on its constituent parts, Inconsistencies will be 
welcome; and with all sides of any argument immediately availible, the 
totality of interrclations among divergent viewpoints will be sooncr under- 
stood, | 

Such an approach reflects the real structure of scientific and acadernic 


200 INFORMATION RETRIEVAL: A CRITICAL VIEW 


fields. The text network of current interest will grow and change, rather 
’ 


than, in ils fixity, depreciating in relevance as a remembrance of the way 
thinking went ata particular time. 


REMARKS 


Curiously, the common view of the information system Appears to take it 
for granted that mechanization will mean depersonalization. “Naturally 
such a machine will not be popular among authors since by and laree ‘i 
reference will be made to the oriinator of any given idea. But, on the other 
hand, perhaps this will be all to the good since it will avoid fine ara leclings 
Shel re sO often produced at the present time by inadequate laine: 
ye There Is no need for such dehumanizing; it will work better the other 
wily, The hypertext approach ics in with existing human institutions and 
moGvations, We must discard the idea that elflicient mechanizatio 
arbitrary truncation, a 

He may be comphuned that we should eliminate the varies and biases 
(at individuals composing hypertexts will introduce. To this | would fa 
(hat vagaries and biases which are signed by individuals, and for which 
they are responsible, are far preferable to those that are silently desiened 
nehiinto the system's heart, | ~ 

lo be usctul, Ihe hypertext mediuna requires some degree of prose dis- 
cursiveness in the miterial, or variely in the ways and sequences that the 
same prtermil can be connected together or explained. Indeed, these texts 
may be made big and diverse cnough for study by specialist and beginner 
a With many enirances, tracks and: specially-oricnted meanders. shee 
lie ue. previous backeround and level of knowledge could be taken 
benehicnlly into account by the author-cditors, 

tomieht be claimed that facilities of such a general character are not 
necessary for the purposes conventionally served by information retrieval 
Phe purpose of tetrieval systems is to inform the technical user of what he 
needs to know, viving short replics to diserete specitic questions, 
| Elis may sometimes be the case. When technical queries seek only to 
hil the blanks im some frozen context, DR or CR will do. Often the user has 
all the backeround he needs, and no dong caplanation is ceded us an 
explanatory frame for the short reply. But even the specialist cannot always 
ask specilic questions or understand short answers, let alone cryptic Ulles 
and summarics of borderline literature. For such needs these systems really 
ought to be ready to teach in the larger sense, even when Gilkiog to pro- 
lessioniils, SAMI systems need the backup capacity for peneral fisiadelinn 
or review, And so the function of the hypertext, even in the SAMI context, 


a re 


GETTING IT Out OF OuR SYSTEM 20) 


will be somewhere between “teaching” and “giving information.” (How 
often there scems no feasible practical distinction between these two. ) 

Such systems will not, of course, replace all information retricval systems. 
Where a lixed-context retrieval structure safely tells the user all he needs to 
know, as does a train schedule, nothing further is required. 


UNDERSTANDING THE MEDIUM 


Computer branching operations, text storage and display determine this 
medium. If this is the case, why is the medium taking so long to be devel- 
oped? | claim that our problem has been not knowing, in some important 
sense, What the things are for, even thouph these devices have been with us 
for some fifteen years. 

If complex text display and handling ts (or determines) a medium, why 
doesn't everybody know i? 

There are two answers to this, which complement each other. 

The first is that various people do know it, have grasped the mediuin’s 
character, The second is that there is no reason to expect everybody to 
catch on to a new medium right away. Let me discuss both of these asser- 
lions. 

To predict that we will have non-linear text media is not very original. 
Similar ideas have been expressed by many people, beginning with Van- 
nevar Bush. if we do not mention the Oz books and other immemorial 
sources. Marshall McLuhan has said it ina very general way: this “mosaic” 
medium is another step in the transition to organic thinking, Icarning and 
society that he predicts.” The suggestion here encompasses views expressed 
by Harold Wooster!" HH. Bohnert and M. Kochen,!! T. M. Willams," and 
(in dilfercnt phices) by several contributors to the Intrex Planning Con- 
ference. Bush's prediction was the most thorough. But soricone identified 
only as “Our Special Correspondent” also made the point rather precisely 
in the Tones Literary Supplement, May 4, 1962, when he (7?) said the 
scholar of the future would “do more than simply manage linear prose. . . - 
Rather, he would “cut it like a film or compose it like a picture. ee 


CULTURAL LAG 


Sociologically, we may regard information retricval (DR,CR) as a some- 
what Conservative response by the professional community to various tech- 
nological innovations. [lis a time-honored sociological theory (advocated 
chielly by W. F. Oxgburn!*) that practices take lime to catch up to inven- 
tions. This is called “cultural lag,” the delay between a change in SOCILLY 
(especislly a technological change) and the culture’s catching up by squar- 





902 INFORMATION RETIEVAL: A CREECAL VIEW 


ing related practices with the change. Confusion and inefliciency follow the 
chanee until the society finds appropriate ways to deal with it. 

Por reasons both personal and corporate, the irruption of technology in 
information handling could not be ignored. He had to be aussumikited. “Tafor- 
mation retrieval” continued the things that had been done before, extend- 
ing the indexing and abstraction processes. Document retrieval is an 
advanced card file; queciable information networks are advanced reference 
charts. 


THE MOTION PICTURE ANALOGY 


[ would like to draw an analozy between hypertext and the mouion pic- 
ture. By considering the time and trouble it recently took to understand 
the nature of the motion picture—another prepared temporal and visual 
medtum—we may see better why things are dillicult now, 

Motion pictures are of interest to us in several contexts; in the develop- 
ment of there clements and techniques; in the development of organized 
units, especkily units which are ellecive and viable; and in the development 
of the signatory property, the ability to be signed. These comparisons are 
sugvestive iuutlogies rather than close parallels. Let me recite some high- 
liehts from the history of motion pictures, so that the sorts of development 
that eccurred, and what was on the ninds of its d&velopers, can cnliphten 
our present. (This will concentrate on the photoplay; fet it be understood 
that other uses of the filmy are derivative. )' 

Fhe technology of the motion picture was not suddenly created in the 
shap of any one inventor. Powever, Edison's invention, first: tried) with 
flay Mi ISS9) was the one that took. Tle called it the Kinemiatoeraph. Et 
permitted fast viewing of successive frames on a lone strip of lili. Edison 
did not work ona projector because he and his advisors thought it would 
destroy the novelty. Ee also failed to patent his machine acon lin V8OS 
Phomas Ariat made a working projector for Edison's film. About the 
sme time. the brothers Lumicre, photowraphic manufacturers in’ France 
creced an all-purpose miovig machine: the Cinematographe, which eatitd 
Douce by tUEns as Gainer. printer and projector. | hey promptly took iton 
the road as a novelty, showing films of local scenes, the arrival of trains 
and the marching of troops. | 

The next year, 1896. a real showman beran making movies in France. 
His tame was Georges Melies. A magichin and theater operator, heat 
once saw the potentiality of the medium for presenting the astenishing and 
fantastic, He made many science-fiction and magic shorts, Méhes criuinated 
fades, dissolves, antimagon, and the whole cane of tick photography and 


Grerine tr Our of Our System 203 


multiple exposures. He is also credited with the invention of editing. With 
Cinderella in 1900 Méliés is said to have produced the first story filin.'® 

Meanwhile, films in America were already popular, having been intro- 
duced to vaudeville in 1896. Credit for being the first “theatrical film” is 
usually given to Edwin S. Porter’s The Great Train Robbery (1903). 
However, Porter's The Life of an American Fireman, produced the year 
before, had contained all the same clements. 

A curious monopoly circumstance, with interesting consequences, prad- 
ually came about in the American motion picture industry, Because various 
companies had tricd to circumvent the Edison movie patents, there were 
many lawsuits over the process around the turn of the century. The legal 
tangle was cleared up by embracing the contending film companies into a 
monopoly. This monopoly, the General Film Trust, controlled all film 
production and distribution in America under the umbrella of the Edison 
patents, Por a bricf time (1909 to 1915) this trust produced films like 
SAUSULES, undifferentiated, with the names of the makers and actors not 
revealed. The oflicers of this trust preferred that the works they produced 
be marketable as an interchangeable commodity of a single length (one 
reel) and quality (incdiocre ). 

DW. Griffith, the founder of the motion picture art and perhaps the 
greatest Elm director of all time, began directing anonymously during this 
period. In his carly work with the Biograph Company, beginning in 1998, 
he experimented in every possible way with the construction of scenes and 
narratives, and soon surpassed every other director. Between 1908 and 
1915 he invented and consolidated modern camera technique, modern 
editing technique, and modern narrative technique. By the end of this period 
his name was known. Ht was he that consciously began constructing the 
film out of shots rather than “scenes.” Griflith’s The Birth of a Nation, 
celeased in 1915S, was a “total” film the way none had ever been. Uniting all 
the techniques that had come before, all his new clfects of camera place- 
ment, cross-cutting, and tempo of movement and editing, he wove history, 
batiles and love story into a single, calire fabric. Audicnces were stupelicd. 
Woodrow Wilson and Lenin were both deeply moved by the flan. 

Thus the twenty years from [895 (Armat and Lumicre machines) to 
1915 (The Birth of a Nation) saw the development of a complete medium, 
in its scope, techniques, and units. No one caught onto it overnight. Nobody 
saw oat the outset what it was going to become. 

Now let us examine sclected aspects of this brief history. The correspon- 
dences to the present situation may be evident. 





204 INFORMATION RETRIEVAL: A CRITICAL WiEW 


A. THE MEDIUMS MISUNDERSTOOD CHARACTER 


If we consider the way some of the carly things were done, it is by no 
means obvious that people working in the film at that time had the faintest 
idea of what was to come. Indeed, it seems that at least some of the film’s 
onvinators did not Know what they were doting or had done. Edison did not 
want to develop a projection system for bis Kinematoeraph because he 
thought ihe novelty would wear off, And when Melics asked to buy the 
Cinematographe, Auguste Lumiere told him: “ht can be exploited for a 
cerkiin Gime as a scientilic curiesity, but, apart from that, i has no com- 
mercial fuitire whatsoever." 
| Through a misunderstanding of the medium, and a fulure to recognize 
ts character, film-makers repeatedly imported to wt the techniques of the 
stage, While stage coment, When adapted, has worked, stage techniques 
have not worked in films, and have given satisfaction only for short periods 
ol Gane. 

The use of Hat backdrops, action ina plane facing the camera, and the 
conuinuous ike throughout a whole scene were all carly conventions 
aostetcted front the stage. Phe dramatic scene of carly films was called: the 
“aeton tableau,” batat had relatively dtd actions dts Hat arraneement was 
aosumple transposition from) the stive, and ity slight motion a prudent 
smuplificauon of ordinary stasing. This was the technique of Melis, who 
also uscd) patted: backdrops shamelessly. “Though the achon tableaux, 
etter Poster, became itiscd with more and more action, heavy ste 
influence rennuned. Even Porter was not completely won over by hts own 
invennions. Though he had proneered three-dimensional action and stirring 
vissal etleets ie The Great Frain Robbery, whe he tiled Uncle Toms 
Calorie tibleais. Mal scat edene 

Stage plays have offen been filined as as; this occasonally Tappens even 
Inthe present day. Hedoes not work very well. In 1907 the balm d'Art Conm- 
pany, in France, was established to uplift the motion picture by filming 
plays. Their prodiet, while hiked for bem cultured, was not very good. 
In 19D ANdelph Zukers Queen Lltcabedt opened. Toowas an attempt to 
mimortahze Sorah Bernhardton tlm, Pomade money. Halso showed rather 
clearly that the sorts of posture, makeup, movement and dechiniitory style 
thourht appropriate for the stave were not at home i the new media, 

But Grithth clearly saw the differences between stige and film. Tle was 
asked whether be thoueht knowledge of the stage was necessary to direct 
films. Phe sand, 

“No, Edo not... Phe stage is a development of centuries, based on cer- 
tain fixed conditions and within prescribed limits. [tis needless to paint out 


GETING IT OuT OF OuR SYSTEM 205 


what these are. The moving picture, although a growth of only a few years, 
is boundless in its scope and endless in its possibilitics. . . . The conditions 


equally dissimilar... 2?! 


B. TECHNIQUES 


There is a basic roster of techniques of film. These are cllects (stop- 
motion, animation), display arrangements (camera angles and movement), 
and the selection and interconnection of parts (direction, cditing ). 

These techniques of film are basic and perfectly known to those who 
make films, though there is disagreement on style and appropriatencss. All 
the techniques are familiar, Nothing is new, in the sense that iis unpossible 
today to create new clements, There has been virtually nothing new in the 
last SO (fifty!) years, since, say, Griflith’s Intolerance (1916).* But these 
techniques of film were by no means obvious. It took twenty years of filin- 
making to discover them, from 1895 to 1915. Though many of the films 
have been lost, in principle cach innovation has associated with ita specific 
film, director and date. ‘The various techniques were introduced one by one, 
chiclly by Mches, Porter and Grillith, 

Early movie-makers did not use more than one shot in a scene. They 
did not move the camera. They instructed their actors to wave their hands 
around and exaeverale every gesture, since the camera was so fur away. 
And the camera was so far away because film-makers thought it looked 
wrong not lo show the actor's whole body, Other rules included: 

“1. Every scene must begin with an entrance and end with an exit, just 
as on the stage. 

“2 Players must face the camera and move horizontally, except when 
the movement was rapid, as ina chase, or prolonged, as ina fight. At these 
times action was in diagonal relation to the camera in order to pive the 
players More area, 

“J Any action in the background must be slow and preatly cxaggcrated 
so that it would ‘repister’ on the audicnee. 

“4 Pantomime must be exaggerated and over-deliberalc, ¢.g., a stare 
had to be held: a start must be violent; speeches must be mouthed with 


pronounced slowness. 7" 


C. UNITS AND ‘THEIR ORGANIZATION 


The character of basic units, in particular the narrative film, was not at 


“Sound, color, etc., did not modify these techniques; but they added separate prob- 
lems of their own and additional opportunitics for synthesis. 


7 , A . 
206 INFORMATION RETRIEVAL: A CRITICAL VIEW 


first grasped. “In 1896, a special outdoor performance by Joseph Jefferson 
of Kip Van Winkle amidst the natural scenery of his summer home on 
Buzzards Bay, Massachusetts, was recorded by Biograph on 219 feet of 
fila, Bat instead of shooting this subject with a view toward an interesting 
and a dramatic continuity, twas filmed in cight episodes billed and sold as 
separate Catalogue numbers... 772! 

Por a while, projected films were timed to the vaudeville turn, aud then 
lo the organizauion and taste of the nickelodeon theater. Thouvh films 
tended naturally to become Jonger, after about 1908 The General Film 
Trust held the length of all Vrust films to one reel (1000 feet, about six- 
een minutes). Etwas not until the decline of the Vrust that the American 
hla could break outof this one-reel mold and attain feature length. 

Not only was the overall unit misunderstood; the parts themselves were 
not thought to be manipulable and diserete. The basic unit was. thought to 
be the “scene” Thwas only DoW. Grillith who dearned to work in ternis of 
ios cather than complete scenes; and to construct the scenes from the 
ground up, usiav the shotoas the building block. Grllith discarded) the 
puinciale that a scene must end before a new shot may begin, and, indeed, 
the principle that one scene must end before another begin.?? By manip- 
lating the constituent parts he learned to handle the motion picture as a 
Whole. controilme and integrating albits aspect, 

Phe chanax, of course, was Grithth’s The Birth of a Nation, Though 
panoranie specticle with three-dimensional sets had been used before, 
espechilly in the Tratian Que Vadiy, tt had net previously been integrated 
with a well-fold story, The Birth of a Nation was such a complete, jinitied, 
specticukir achievement: the true feature film, and a model seldom 
equalled 


D. VE SIGNATORY PRINCIPLE 


“Pals ie made by mien, not machines or corporations.” =? Yet mterest- 
ligly cnoueh, the nedion Chat liltas should be scned —that is, have personal 
credit go to there creators: was not acknowledged for a time. Quite the 
contrary: credit was for a time suppressed, 

Phe hin tats managemest of the motion picture industry from 1909 
to TOTS produced an even stream of onereel tilinsy whose actors and 
directors Were not Known to the pubhe. Phe public was developini favor- 
ites, and demanding then names-- dike that of the pirl everyone Lnew as 
“Pathe Mary.” The exhibitors, too, wanted the mumes so they could in some 
Measure control public response, be. book the favorites and advertise. 


But the favorites’ names were kept a seeret. (The lilm trust's policy of not 





Gettinc ly Out OF Our SYSTEM 207 


releasing the names of actors was ably met abroad by the simple cxpedicnt 
of assigning names to then.**) 

The cat got out of the bag in 1910. Carl Laemmle of the trust-defying 
Imp Company publicly advertised the name of a film actress for the first 
time, after he had stolen her from a trust studio, Thus began the star sys- 
tem. Competition from Lacmmle and other outlaw producers forced the 
trust companies to reveal names. (“Little Mary” found it expedient to- 
adopt the name of Mary Pickford.) Grifith’s name was first advertiscd 
when Biograph gave in to the new open-names policy in 1913. 


INIMANENCE 


I have attempted to point out some curious similarites between 2 general- 
purpose medium we all Know and a general-purpose medium that none of 
us, tomy Knowledge, has seen. 

The comparison would be strongest, of course, if we based it on a premise 
of structural determinism, This would be to assert that media like the hyper- 
text and film derive their character from intrinsic principles and internal 
dynamics that are somehow built ght mto them. 

This was the view held by Kuleshov, the great Russian teacher of film. 
He said: “In every art there must be firsta material, and secondly, a method 
of composing this material specially adapted to this art.”25 As the reader 
sees, it took several decades to discover these for the film. 

Perhaps in general we have no leeway, then, and media arc ours mercly 
to discover rather than to invent. But fam not sure that for the hypertext 
it is this Gieht. We can in hypertexts establish precedents and traditions, 
doctrines and schools; we can, by stress upon programmed instruction, 
Mash card effects or civil liberties, influence for worse or better its uscful- 
hess to our society and condition, 

The medium has an internal dynamic, | think, immanent anc iIn(riNsic 
in the technology, This we must discover. But however compelling this 
internal dynaniic may be, there will be play for some adjustments. We must 
strive to add the touches that suit it best to the way we want to do things. 


A DISANALOGY: STANDARDIZATION 


By luck, movies are standard. The width and sprockcuing of Edison's 
opin! tims. became and remained the 35mm motion: picture standard, 
While there are other standards for other purposes (8min, 16mm) and 
other rival processes (3-D, introduced in 1922, and such other intermit- 








208 INFORMATION RETRIEVAL: A CRITICAL VIEW 


tent specials as wide-sercen and smell track), tn general, a movie is a 
movie.” 

In order to try out the new medium, other than haphazardly, standard- 
ved settings must be created, so that work in this medium can be used and 
passed cround. We need slindardized character-sets and connector notation, 
standard ways of scaling down a presentation to a sparse facility and of 
Interfacing it Chrongh varying sereens, buffers and accessories, This night 
be accomplished by the specification of standard facility levels, so the 
creators of such things could either stipulate a minimum system required 
forus: of a piven text, or an appropritte presentation fevel for cach con- 
heurston ob hardware. 

Moreover, i will be necessary to establish reliable royalty provisions 
for authors, editors and contributors, probably based on the presentation 
frame. But something equivalent to ownersdip of a copy will also be neces- 
sary, so that a given user may have unlimited access to his own possessed 
maternal, 


CONCLUSION 


Information retrieval has cxtended traditional techmiques to new: tech- 
nology. An analogy has been found ta the history of motion pictures. The 
poier techniques of the theater were at first extended to movies, and it was 
several decades before we understood the motion picture as a medium, Tt 
Was ais astural to extrapolite these prior techniques to the movig as il now 
seems ttural to extrapokite prior techniques of information handling to 
the CR T-computer-mitss tiemory system, Here aeain, as in films, the tech- 
Hology Mitkes possible a imediam whieh is radically new and has dynamics 
of ts own. The most statible techaiques anal units were not maimediately 
eVideat in movies, cher, 

The conventional “document” ts not God-eiven, and tn fictis imappro- 
priate lor most pueposes. Systems based on discrete and tolited documents 
relinguesh the greatest power of the new technology. But at the other 
CAtreMmMe Weare wrong fo suppose that aa informaiton machtne can or 
Should climate the human tisk of composition, We have not berun to 
e\plore the possibilities ef natural lanevuave woven into more complex (but 
aso nalurah) arruigements, 


The problem of Getting It Out of Our System, then, ts not the problem 





His important to note that facihitigs are readily avaikible for the transposition of 
AP stundard work to the main standard, by optical printing and sound transfer, Often 


novelly cleets olfercd in fiest-run theaters are sealed down to the standard for 
distribution, 





GrerrinGa lr Our or Our SYSTEM 209 


of fishhook design for a document pool, or of creating a conversational 
black box with a narrow vocabulary. We must get out of our system the 
fixitics of thinking and procedure that hold us back. 


CITATIONS 


—e 
. 


J.C. RR. Licklider, Libraries of the Future. M.1.T. Press, 1965S. 

2. RoE. Simmons, “Answering English Questions by Computer,” Com- 
munications of the ACM 8:1 (January 1965), 53-69. 

3. Thomas Marill, “Libraries and Question-Answering Systems.” Report 
Number 1071, Bolt Beranek and Newman, Inc. 

4. Bert F. Green, Alice Wolf, Carol Chomsky & Kenneth Laughery, 
“Baseball: an Automatic Question-Answerer” in E. A. Feigenbaum 
and J. Feldman (eds.), Computers and Thought. McGraw-tlill, 
POG3. 207-216; 

5. Theodor Hf. Nelson, “A File Structure for the Complex, the Changing 
and the Indeterminate.” Association for Computing Machinery Pro- 
ceedinyy of the 20th National Conference, 1965, pp. 84-100. 

6. Theodor Hf. Nelson, “The Hypertext.” Paper presented at 1965 Con- 
gress of the International Federation for Documentation, Washington, 
Dt Ocober 1965: 

7. Andrew W. Booth, “The Kinds of Machine Now in Use” (No. 4 in 
the series “Freeing the Mind”), The Times Literary Supplement, 
Apnt 13, 1962, p. 252. 

8. Vannevar Bush, “As We May Think.” Aflantic Monthly 176:1 
(Pils 194 I. 

9. Marshall McLuhan, Understanding Media: the Extensions of Man. 
NMeGraw-Ttill, 1965S. 

10. Harold Wooster, “A Web of Words.” Keynote Address of Sixth 
Anaual Institute in ‘Pechotcal and Tadustrial Communications, Col- 
orado State University, Fort Collins, Colorado, July 1963. 

11. Hf. Bohnert and M. Kochen, “The Automated Multilevel Encyclopedia 
as a New Mode of Scientific Communication,” in Manfred Kochea, 
Some Problems ino Information Science. Scarecrow Press, 1965, 
16. 260), 

12. 1. M. Willianis, “Some Logical Aspects of Thematic Representation 
with Explicit Ties.” Paper presented at 1965 Congress of the Inter- 
national Federation for Documentation, Washington, D.C., October 
LOGY. 

13. Carl F. J. Overhage and K. Joyce Harmon, Intrea: Report of a Plan- 

nine Conference on Information Transfer Experiments. M.1.T. Press, 


210 


20. 
pay 
Ze, 


23. 
24. 


2S; 
26. 


INFORMATION RETRIEVAL: A CRITICAL VIEW 


1965. Two notes by J. C. R. Licklider are of special interest: “An 
On-Line Information Network,” 147-155, and “Proposed Experi- 
ments in Browsing,” 187-197. Sce also E. S. Proskaucr, “The Motiva- 
tions of Authors—Intellectual Property and the Computer,” 199-201. 
Our Special Correspondent, “Poctry, Prose and the Machine’ (No. 
7 in the serics “Freeing the Mind”), The Times Literary Supplement, 
May4, 196292310: 


. William F. Ogburn, “Cultural Lag as Theory.” Sociology and Soctal 


Research 41, 167-174. Jan-Feb 1957. 

Richard Gritith and Arthur Mayer, The Movies, Bonanza, 1957, 
p. 6. 

Maurice Bardéche and Robert Brasillach, The History of Motion 
Pictures (tr., ed. tris Barry), W. W. Norton Co. and the Muscum of 
Modern Art, 1938, p. 10. 


. A. Nicholas Vardac, Stage to Screen: Theatrical Method from Gar- 


rick to Griffith. Harvard University Press, 1949, p. 171. 


Lewis Jacobs, The Rise of the American Film: a Critical History. 


Harcourt, Brace, 1939, pp. 118-119. 

Jacobs, op. cit., p. 60. 

NAG0NC OP. Cla. 100. 

Ernest Lindgren, The Art of the Film. London: Allen and Unwin, 
Ni a 0 eat 0 

Iris Barry, quotcd by Richard Gnffith in “Moving Pictures,” Ency- 
clopedia Americana, 1960 edition, v. 19, p. 540. 

Gnfhth and Mayer, op. cit., p. 47. 

Jacobs, op. cit., p. 88. 

This is Pudovkin's reminiscence, cited by Lindgren, op. cit., p. 76. 


_-— OOo 


——_— 


(caw aa : + an - « 
. 


